Rule−based Categorial Analysis of Unprompted Speech − a Cross−language Study

نویسنده

  • N. Beringer
چکیده

In this study, we investigated the influence of language specifics in a cross-language task on the automatic segmentation with a self-learning algorithm for the integration of pronunciation rules. The goal of this paper is to present the linguistic and statistic results of a new method to automatically generate pronunciation rules for automatic segmentation of speech the German MAUSER system. MAUSER is an algorithm which generates pronunciation rules independently of any domain dependent training data either by clustering and statistically weighting self-learned rules according to a small set of phonological rules clustered by categories or by re-weighting “seen” phonological rules. For the generation of pronunciation rules the used algorithm does not require any domain dependent training data. By this method we are able to automatically segment cost-effectively large corpora of mainly unprompted speech.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Independent automatic segmentation by self-learning categorial pronunciation rules

The goal of this paper is to present a new method to automatically generate pronunciation rules for automatic segmentation of speech the German MAUSER system. MAUSER is an algorithm which generates pronunciation rules independently of any domain dependent training data either by clustering and statistically weighting self-learned rules according to a small set of phonological rules clustered by...

متن کامل

An Analysis of speeches of Hussein ibn Ali (AS) in the first step toward the incident of Karbala (Departing Medina to Mecca) based on John Searle’s Speech Acts

Linguistic theories can open new doors to historical analysis. This paper seeks to analyze the speeches of Hussein ibn Ali in the first step toward the incident of Karbala which was his departure from Medina to Mecca. The Speech Acts theory which roots in Discourse Analysis focuses on the role of language. It sees speech as an act that brings about actions in this world. Searle introduces only...

متن کامل

Categorial grammars used to partial parsing of spoken language

Spoken language understanding is a challenge for the development of Spoken Dialogue Systems. Recognition errors and speech repairs make it impossible to get complete syntactic analysis. Shallow parsing and chunking seem to be efficient in order to start both a robust and precise analysis. This paper describes experiments made with Logus, a spoken understanding system based on incremental methol...

متن کامل

A comparative sociopragmatic analysis of wedding invitations in American and Iranian societies and teaching implications

Wedding invitations (WIs), as a uniquely socially and culturally constructed genre, provide a distinct opportunity  to  compare  the  sociocultural values of different  speech  communities  as reflected  in  the  textual  content  and  organization  of  the  different  moves.  Students  can  be exposed  to  this  genre  and  its  different moves  using  a  genre-based  pedagogy. Genre-based ped...

متن کامل

An Inference-rules based Categorial Grammar Learner for Simulating Language Acquisition

We propose an unsupervised inference rules-based categorial grammar learning method, which aims to simulate language acquisition. The learner has been trained and tested on an artificial language fragment that contains both ambiguity and recursion. We demonstrate that the learner has 100% coverage with respect to the target grammar using a relatively small set of initial assumptions. We also sh...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003